AITopics | regret and constraint violation

00295cede6e1600d344b5cd6d9fd4640-Paper-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 07:15:16 GMT

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Genre: Research Report (0.46)

Industry: Energy (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Online Convex Optimization with Stochastic Constraints

Neural Information Processing SystemsMar-17-2026, 17:49:03 GMT

This paper considers online convex optimization (OCO) with stochastic constraints, which generalizes Zinkevich's OCO over a known simple fixed set by introducing multiple stochastic functional constraints that are i.i.d.

artificial intelligence, constraint-based reasoning, proceedings, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.49)

Add feedback

Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient Algorithm

Neural Information Processing SystemsFeb-18-2026, 00:40:33 GMT

This paper explores the realm of infinite horizon average reward Constrained Markov Decision Processes (CMDPs).

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

ca460332316d6da84b08b9bcf39b687b-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 04:11:42 GMT

artificial intelligence, constraint, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)
North America > United States > Pennsylvania (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)

Add feedback

ae95296e27d7f695f891cd26b4f37078-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 20:14:59 GMT

arxiv preprint arxiv, constraint, probability, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan (0.04)
North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.35)

Add feedback

ProvablyEfficientModel-FreeConstrainedRLwith LinearFunctionApproximation

Neural Information Processing SystemsFeb-9-2026, 02:31:23 GMT

We study the constrained reinforcement learning problem, in which an agent aims tomaximize the expected cumulativereward subject toaconstraint on the expected total value of a utility function. In contrast to existing model-based approaches or model-free methods accompanied with a'simulator', we aim to develop thefirst model-free, simulator-freealgorithm that achieves a sublinear regret and a sublinear constraint violation even inlarge-scale systems.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Ohio > Franklin County > Columbus (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Michigan > Wayne County > Detroit (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)

Add feedback

ProvablyEfficientModel-FreeConstrainedRLwith LinearFunctionApproximation

Neural Information Processing SystemsFeb-9-2026, 02:31:19 GMT

We study the constrained reinforcement learning problem, in which an agent aims tomaximize the expected cumulativereward subject toaconstraint on the expected total value of a utility function. In contrast to existing model-based approaches or model-free methods accompanied with a'simulator', we aim to develop thefirst model-free, simulator-freealgorithm that achieves a sublinear regret and a sublinear constraint violation even inlarge-scale systems.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Ohio > Franklin County > Columbus (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Michigan > Wayne County > Detroit (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.49)

Add feedback

6abba5d8ab1f4f32243e174beb754661-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 18:43:43 GMT

algorithm, algorithm 1, constraint violation, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.72)

Add feedback

SimpleandFastAlgorithmforBinaryIntegerand OnlineLinearProgramming

Neural Information Processing SystemsFeb-8-2026, 18:43:35 GMT

Our algorithm employsonecolumn forsubgradient descent ineach iteration, whereas thedual project subgradient algorithm requires the whole constraint matrix and conducts matrix multiplication in each iteration. In addition, a class of backpressure/max-weight algorithms [25] are developed in the control/queueing literature and the backpressure algorithm can be interpreted from a view of pressuregradient.

algorithm, artificial intelligence, constraint-based reasoning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.49)

Add feedback

Online Convex Optimization with Stochastic Constraints

Neural Information Processing SystemsNov-21-2025, 16:07:31 GMT

This paper considers online convex optimization (OCO) with stochastic constraints, which generalizes Zinkevich's OCO over a known simple fixed set by introducing multiple stochastic functional constraints that are i.i.d.

name change, online convex optimization, stochastic constraint, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.49)

Add feedback

Filters

Collaborating Authors

regret and constraint violation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

00295cede6e1600d344b5cd6d9fd4640-Paper-Conference.pdf

Online Convex Optimization with Stochastic Constraints

Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient Algorithm

ca460332316d6da84b08b9bcf39b687b-Paper.pdf

ae95296e27d7f695f891cd26b4f37078-Paper.pdf

ProvablyEfficientModel-FreeConstrainedRLwith LinearFunctionApproximation

ProvablyEfficientModel-FreeConstrainedRLwith LinearFunctionApproximation

6abba5d8ab1f4f32243e174beb754661-Supplemental.pdf

SimpleandFastAlgorithmforBinaryIntegerand OnlineLinearProgramming

Online Convex Optimization with Stochastic Constraints